AD-A077  707  MARYLAND  UNIV  BALTIMORE  COUNTY  BALTIMORE  OEPT  OF  MAT— FTC  F/G 
DETECTION*  ESTIMATION,  AND  CONTROL  ON  GROUP  MANIFOLDS. (U) 

MAY  79  J  T  LO  AF0SR-74-?671 

UNCLASSIFIED _ AFOSR-TR-79-1093  NL 

I  OF  | 

AD 

A077701 


ADA07770? 


Final 


Detection,  Estimation,  and  Control 


on  Group  Manifolds 


Air  Force  Office  of  Scientific  Research 
Air  Force  Systems  Command 
USAF 


FROM 


James 


Department  of  Mathematics 
University  of/ Maryland  Baltimore  County 
Baltimore,  Maryland  21228 


SUBJECT 


Detection,  Estimation,  and  Control 
on  Group  Manifolds 


covering  the  period 
June  1,  1974  May  31, 


Approved  for  public  release 
distribution  unlimited. 


TABLE  OF  CONTE 


II.  Publications 


III.  Summary  of  Progress 


(1)  Global  Bilinearization  of  Systems  with  Control 
Appearing  Linearly 


(2)  Eigenvector  Placement  in  Linear  Multivariable 
Systems 


(3)  Signal  Detection  for  Bilinear  Systems 


(4)  Exponential  Fourier  Densities  and  Estimation  and 
Detection  on  a  Circle 


Estimation  for  Discrete-Time  Directional  Processes 


(6)  Exponential  Fourier  Densities  on  S  and  Optimal 

Estimation  and  Detection  of  Directional  Processes 


Exponential  Fourier  Densities  on  S0(3)  and  Optimal 
Estimation  and  Detection  of  Rotational  Processes 


(8)  Exponential  Fourier  Densities  and  Optimal  Estimation 
for  Axial  Processes 


(9)  Estimation  and  Detection  on  Lie  Groups 


(10)  Estimation  Problems  with  Lie  Group  Structure 


(11)  Optimal  Filters  for  Nilpotent  Associate  Algebraic 
Bilinear  Systems 


(12)  Optimal  Filters  for  Bilinear  Systems  with  Nilpotent 
Lie  Algebras 


(13)  Optimal  Estimation  for  the  Satellite  Attitude  Using 
Star  Tracker  Measurements 


Convolution  of  Exponential  Fourier  Densities  and 
Filtering  on  the  Circle 


I.  General 


This  final  report  covers  work  carried  out  by  faculty  and  staff  members  of 
the  Department  of  Mathematics  during  the  five-year  period  1  June  1974  through 
31  May  1979  under  Grant  No.  AFOSR-74-2671. 

Progress  was  mainly  contained  in  the  technical  papers  and  reports  listed 
in  Section  11.  An  interim  report  was  submitted  to  AFOSR  at  the  end  of  each  of 
the  first  four  years. 

During  the  report  period,  the  following  people  contributed  to  the  project: 
Associate  Professor  James  T.  Lo,  Dr.  Linda  R.  Eshleman,  Dr.  Masahiro  Nishihama, 
and  Dr.  Shirish  Chikte. 
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Ill .  Summary  of  Progress 

The  results  covered  by  this  final  report  can  be  put  in  two  categories  - 
continuous-time  systems  and  discrete-time  systems.  The  cont inuous-t ime 
systems  considered  are  bilinear  in  form.  The  bilinear  systems  have  been 
extensively  studied  in  recent  years  for  three  primary  reasons.  First ,  it  has 
been  shown  that  bilinear  systems  are  feasible  mathematical  models  for  large 
classes  of  problems  of  practical  Importance.  Second,  bilinear  systems 
provide  higher  order  approximations  to  nonlinear  systems  than  do  linear  systems. 
Third,  bilinear  systems  have  rich  geometric  and  algebraic  structures  which 
promise  a  fruitful  field  of  research. 

One  of  our  main  results  gave  a  necessary  and  sufficient  condition  that  a 
nonlinear  system,  with  control  appearing  linearly,  be  dynamically  equivalent 
to  an  observable  bilinear  system.  When  the  condition  is  satisfied,  a 
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procedure  to  construct  an  observability  canonical  form  of  such  a  bilinear 
system  was  provided. 

Many  interesting  detection  problems  including  the  demodulation  for  a 
frequency-shift-keyed  binary  communicat ion  system  involve  bilinear  systems. 
Such  problems  were  studied  by  us  from  the  proposed  group-manifold  viewpoint. 

A  likelihood  ratio  formula  was  derived  as  a  function  of  up-dated  observation. 

A  least -squares  filtering  problem  in  which  the  signal  process  to  be 
estimated  is  generated  by  a  nilpotent  bilinear  system  was  studied.  It  was 
found  that  the  filtering  equations  are  also  bilinear  in  form  and  possesses 
the  nilpotency  property.  By  means  of  an  illustrative  example,  it  was  shown 
how  the  dimensionality  of  the  filter  may  be  reduced  by  eliminating  certain 
inherent  redundancies. 

Perhaps,  the  most  important  contribution  made  by  us  during  the  report 
period  is  the  discovery  and  development  of  the  exponential  Fourier  densities 
on  Lie  groups  for  the  discrete-time  systems.  The  EFD's  were  found  to  have 
three  desirable  properties  that  provide  us  with  a  firm  grip  on  discrete¬ 
time  detection  and  estimation  problems  on  group  manifolds.  First,  any 
continuous  or  bounded-var iat ion  density  on  a  compact  Lie  group  can  be 
approximated  as  closely  as  desired  by  an  EFD.  Second,  the  EFD's  of  a  given 
order  are  closed  under  the  operation  of  taking  conditional  distributions. 
Third,  they  are  "almost"  closed  under  convolution. 

All  these  properties  and  their  applications  to  estimation  and  detection 
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on  S  ,  S0(3),  S  ,  S  /±1,  and  general  compact  Lie  groups  were  reported  in  most 
of  the  publications  listed  in  Section  II,  which  reflect  that  about  6*>7  of  our 
research  effort  supported  by  the  Grant  was  made  to  develop  the  full  strength 
of  such  densities.  Above  all,  a  practical  application  of  the  EFD's  to  the 
satellite  attitude  determination  using  star  tracker  measurements  was  carried 
out  during  the  report  period.  The  application  represents  a  new  approach  to 
spacecract  attitude  determination  and,  perhaps,  the  first  optimal  nonlinear 
filter  ever  implemented  for  a  real-world  system. 

We  will,  in  the  following,  summarize  the  ideas  and  results  of  each  of  the 
publications. 


(1)  Global  Bilinearization  of  Systems  with  Control  Appearing  Linearly 

Bilinear  systems  have  been  extensively  studied  in  recent  years  for  three 
primary  reasons.  First,  it  has  been  shown  that  bilinear  systems  are  feasible 


mathematical  models  for  large  classes  of  problems  of  practical  importance. 
Second,  bilinear  systems  provide  higher  order  approximations  to  nonlinear 
systems  than  do  linear  systems.  (Linear  systems  are  special  cases  of  bilinear 
systems.)  Third,  bilinear  systems  have  rich  geometric  and  algebraic  structures 
which  promise  a  fruitful  field  of  research. 

Perhaps  this  paper  can  provide  the  fourth  reason  for  studying  bilinear 
systems.  It  is  shown  in  the  paper  that  a  large  class  of  control  systems 
with  highly  nonlinear  features  can  be  t  rans formed  into  bilinear  svstems.  A 
necessary  and  sufficient  condition  that  a  nonlinear  system,  with  control 
appearing  linearly,  be  dynamically  equivalent  to  an  observable  bilinear 
system  is  derived.  When  the  condition  is  satisfied,  a  procedure  to  construct 
an  observability  canonical  form  of  such  a  bilinear  system  is  provided  in  the 
proof  of  the  sufficiency  part  of  the  condition. 

A  precise  statement  of  the  main  result  is  given  as  a  theorem  on  page  881 
of  the  paper. 

(2)  Eigenvector  Placement  in  Linear  Multivariable  Systems  -- 

A  very  important  system  design  problem  may  best  be  explained  by  the 
following  hypothetical  example  of  Jet  engine  control:  The  control  mechanism 
of  a  Jet  plane,  which  is  composed  of  an  electronic  analog  computer  (or  a 
digital  computer)  and  a  mechanical  actuator,  should  be  designed  to  meet  two 
main  goals.  First,  the  mechanism  should  be  able  to  shift  every  relevant 
component  of  the  engine  and  the  airplane  body  into  its  best  operation  level 
for  each  possible  maneuver.  Second,  the  action  required  of  the  pilot  to 
accomplish  each  maneuver  should  be  made  as  simple  as  possible.  As  many 
feats  can  be  done  by  linear  autonomous  systems,  it  is  not  too  restrictive  to 
assume  that  such  a  system  can  be  used  to  accomplish  the  first  goal.  However, 
feeding  input  appropriately  may  be  too  large  a  Job  for  the  pilot,  particularly 
if  there  are  a  large  number  of  input  terminals.  Therefore,  how  to  reduce 
the  external  control  terminals  that  are  directly  maneuvered  by  the  pilot  with¬ 
out  affecting  the  airplane  performance  is  a  crucial  design  problem. 

This  is  exactly  the  kind  of  problem  studied  in  this  paper.  A  necessary 
and  sufficient  condition  is  given  for  reducing  the  number  of  external  input 
terminals  without  affecting  the  controllability  property  of  the  linear 
system.  When  this  condition  is  satisfied,  an  output  feedback  can  be  computed 
by  an  algorithm  given  in  the  paper.  This  output  feedback  gain  shows  how  to 


hook  up  the  output  and  the  undeslred  Input  terminals,  and  thus  reduce  the 
operating  burden  of  the  pilot. 

Mathematically  speaking,  this  paper  presents  a  necessary  and  sufficient 
condition  for  the  existence  of  a  linear  output  feedback  law  which  shifts 
the  system  eigenvectors  out  of  an  arbitrary  linear  subspace.  When  the 
condition  is  satisfied,  a  procedure  to  construct  such  an  output  feedback 
law  is  given  in  the  proof  of  the  condition. 

(3)  Signal  Detection  for  Bilinear  Systems  -- 

Many  Interesting  detection  problems  Including  the  demodulation  for  a 
frequency-shift-keyed  binary  communication  system  involve  bilinear  systems. 
This  paper  studies  such  problems  from  the  proposed  group-manifold  viewpoint. 

The  injection  operator  used  by  McKean  to  construct  Brownian  paths  on 
a  Lie  group  is  employed  to  formulate  a  class  of  signal  detection  problems  on 
matrix  Lie  groups.  The  hypotheses  that  the  signal  is  absent  and  present  in 
the  observation  on  a  Lie  group  are  described  by  a  pair  of  bilinear  matrix 
Ito  equations.  The  injection  operator  Is  shown  to  be  almost  surely 
bijective  and  its  inverse  is  constructed.  This  bijection  enables  us  to  do 
detection  and  estimation  of  processes  taking  values  on  a  Lie  group  in  a 
finite  dimensional  linear  space--the  Lie  algebra.  In  fact,  bilinear  detection 
on  Lie  groups  ("locally  linear  spaces")  is  in  essence  equivalent  to  linear 
detection  on  linear  spaces.  A  likelihood  ratio  formula  for  bilinear  matrix 
Ito  equations  is  thus  derived  as  a  function  of  up-dated  observation. 

(4)  Exponential  Fourier  Densities  and  Estimation  and  Detection  on  a  Circle 

In  a  series  of  recent  papers,  the  Fourier  series  is  used  to  represent 

probability  densities  in  studying  estimation  on  the  circle,  S*  -  [-11,  II}. 

Many  suboptimal  estimation  schemes  have  been  obtained  by  truncating  the  higher 
frequency  terms  as  necessitated  in  any  numerical  equation.  However,  simula¬ 
tion  results  indicate  that  the  approach  is  not  fully  satisfactory. 

One  intuitive  explanation  is  that  the  higher  order  harmonic  terms  need 

not  be  negligible,  as  evidenced  by  the  formal  Fourier  expansion,  p  (x)  - 

+  cob  n  (x  -  x0),of  the  unit  mass  density  at  x  ■  x0.  As  a  matter 

of  fact,  applying  the  Bayes  rule  to  calculate  a  conditional  density  always 
involves  the  multiplication  of  two  a  priori  densities,  and  the  multiplication 
of  two  Fourier  series  has  the  effect  of  spreading  the  dominent  Fourier 
coefficients  into  higher  frequency  terms.  Consequently,  the  approach 
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places  severe  limitations  on  the  performance  of  the  suboptimal  schemes 
obtained  from  truncating  the  higher  frequency  terms.  Obviously,  the  dilemma 
1b  especially  serious  in  multistage  estimation  when  a  sequence  of  multi¬ 
plications  of  Fourier  series  takes  place. 

The  purpose  of  the  paper  is  to  propose  an  alternative  approach  to  avoid 

this  difficulty.  The  underlying  idea  of  the  approach  is  very  simple.  A 

probability  density  function  of  can  be  viewed  as  a  periodic  function  on 

the  real  line.  As  the  function  is  nonnegative,  it  can  be  shown  that  the 

function  can  be  approximated  as  closely  (to  be  specified  later)  as  we  wish 

by  a  function  in  the  form, 

n 

exp  (A,  cos  kx  +•  b,  sin  kx) 

k=0  k  k 

which  we  will  call  an  exponential  Fourier  density  of  the  n-th  order.  It  is 
obvious  that  the  multiplication  of  two  such  densities  does  not  raise  the  order 
of  the  densities.  Thus  the  class  of  all  n-th  order  Fourier  densities  is 
closed  under  the  operation  of  taking  conditional  distributions.  We  recall 
that  a  property  of  Gaussian  densities  responsible  for  the  simple  K-B  filtering 
is  exactly  such  a  closure  property.  Therefore,  it  is  not  a  surprise  that  the 
use  of  exponential  Fourier  densities  leads  to  simple  resolutions  of  many 
estimation  problems  on  the  circle. 

In  this  paper  a  basic  theorem  for  approximating  a  probability  density 
with  an  exponential  Fourier  density  is  given.  Three  models  are  then  formulated 
and  studied  which  are  believed  to  be  generic  in  practical  problems  of  detection 
and  estimation.  Optimal  recursive  schemes  are  derived  for  these  models, 
thereby  illustrating  how  the  closure  property  mentioned  above  facilitates 
finite-dimensional,  closed-form,  and  exact  solution. 

(5)  Estimation  for  Discrete-Time  Directional  Processes  -- 

In  many  problems  of  practical  importance,  we  are  concerned  with  three 
dimensional  directions.  A  notable  engineering  example  of  this  kind  is  the 
earth-pointing  satellite  attitude  determination  and  control.  Some  other 
examples  can  be  found  in  orientation  analysis  in  biology,  dip  and  declination 
study  in  geology,  seasonal  fluctuation  phenomena  in  medicine  and  wind  direction 
forecast  in  meteorology. 

It  is  not  pretended  that  in  this  paper  we  resolve  any  specific  estimation 
problems  Involved  in  these  practical  subjects.  The  purpose  of  this  paper 


is  rather  to  propose  a  simple  mathematical  model  which  is  believed  to  be 
generic  in  many  such  situations  involving  directional  data,  and,  by  detailed 
analysis  of  the  model,  to  illustrate  the  structures  of  multistage  estimation 
for  directional  processes.  We  call  a  process,  whose  state  space  is  a  circle 
a  sphere,  or  a  hvpersphere,  a  directional  process. 

In  this  paper  we  are  mainly  concerned  with  three-dimensional  directional 
processes,  processes  on  a  sphere.  Estimation  for  higher  dimensional  direc¬ 
tional  processes  can  be  Lreated  similarly.  Estimation  on  a  circle  is  substan¬ 
tially  simpler.  In  fact  the  approach  used  here  is  an  (nontrivial)  extension 
of  that  used  in  (4). 

Error  criteria,  probability  distributions,  and  optimal  estimates  on  a 
sphere  are  discussed.  Exponential  Fourier  densities  on  a  circle  introduced 
in  (4)  are  generalized  to  those  on  a  sphere.  The  underlying  idea  for  using 
the  exponential  Fourier  densities  is  the  same  as  that  on  the  circle  in  (4). 
Namely,  the  closure  property  of  these  densities  under  the  operation  of  taking 
conditional  densities  enables  us  to  compute  the  optimal  estimates  efficiently 
by  updating  only  a  fixed  and  finite  number  of  parameters.  In  contrast  to  the 
circle  case,  this  closure  property  does  not  exist  for  all  combinations  of 
signal  and  noise  densities  on  the  sphere.  As  a  matter  of  fact,  a  key  issue 
addressed  in  this  paper  is  to  characterize  such  combinations  of  signal  and 
noise  densities  that  have  this  closure  property. 
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(6)  Exponential  Fourier  Densities  on  S  and  Optimal  Estimation  and  Detection 
of  Directional  Processes  -- 

2 

Optimal  estimation  on  S  ,  the  unit  sphere,  was  first  studied  in  [5 1 
using  exponential  trigonometric  densities  (ETD's).  The  results  in  ['’l 
have  two  distinct  deficiencies:  (1)  The  class  of  admissable  noise  densities  is 
restricted.  (2)  The  signal  must  be  a  constant  direction.  It  is  these  defi- 
ciences  that  motivated  the  continuing  effort  which  results  in  the  present 
paper. 

The  first  deficiency  led  to  the  use  of  the  class  of  exponential  Fourier 
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densities  (EFD's)  on  S  .  Any  continuous  density  can  be  approximated  as  closely 
AN  desired  by  an  EFD. '  Furthermore,  the  EFD’s  are-all  admissable  noise  densities. 

The  second  deficiency  led  to  the  use  of  three  new  kinds  of  displacement  on 
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S  .  Each  kind  can  be  used  to  construct  one  time-varying  signal  process  and  one 
observation  process.  Various  combinations  of  the  signal  processes  and  the  obser- 


vat  ion  processes  yield  16  different  estimation  models.  The  additive  observation 
noise  case  is  also  considered.  The  merits  and  disadvantages  of  each  model  are 
discussed  in  detail,  when  recursive  formulas  are  derived  for  the  multistage 
conditional  densities  for  many  of  the  models. 

In  general,  neither  the  class  of  ETD's  nor  the  class  of  EFD's  is  closed 
under  the  operation  of  taking  conditional  distributions  with  respect  to  each  of 
the  models.  An  alternative  usage  of  the  classes  is  required  to  derive  each  of 
the  recursive  schemes  for  optimal  estimation. 

Signal  detection  problems  for  the  models  are  also  studied.  Recursive  algo¬ 
rithms  for  the  likelihood  ratios  are  obtained  for  many  of  the  models. 

In  short,  this  paper  gives  a  comprehensive  and  comparative  treatment  of 
many  estimation  and  detection  models  on  the  sphere.  We  believe  that  it  provides 
a  chest  of  tools  which  will  be  useful  in  analyzing  and  synthesizing  practical 
estimation  and  detection  problems. 

(7)  Exponential  Fourier  Densities  on  S0(3)  and  Optimal  Estimation  and  Detec¬ 
tion  of  Rotational  Processes 

Rigid  body  rotations  are  involved  in  many  important  practical  problems  of 
detection,  estimation,  and  control.  Some  notable  examples  can  be  found  in 
gyroscopic  analysis  and  satellite  attitude  determination  and  control.  While 
linearization  and  approximation  techniques  have  led  to  many  useful  results, 
simple  analytic  tools  which  will  enable  us  to  analyze  and  synthesize  the  optimal 
structures  have  long  been  desired. 

Optimal  estimation  and  detection  schemes  for  discrete-time  processes  whose 
state  space  is  a  circle  or  sphere  have  been  obtained  in  (!*)  and  (g)  by  using  a 
novel  representation  for  probability  densities  which  has  the  form  exp  f  where  f 
is  a  finite  linear  combination  of  functions  which  form  a  complete  orthogonal  sys¬ 
tem  on  the  state  space  involved.  In  the  case  of  the  circle,  circular  functions 
were  used,  while  both  spherical  harmonics  and  multiple  trigonometric  functions 
were  employed  for  densities  defined  on  the  sphere. 

In  this  paper  the  same  approach  will  be  taken  for  discrete-time  rotation¬ 
al  processes  by  introducing  a  similar  exponential  density  referred  to  as  a 
rotational  exponential  Fourier  density  (REFD)  defined  on  the  group  of  rota¬ 
tions  of  three-dimensional  space,  that  is  obtained  by  using  a  sequence  of  irre¬ 
ducible  unitary  representations  which  form  a  complete  orthogonal  system  on 
S0(3).  It  can  be  shown  that  a  continuous  density  function  on  S0(3)  can  be 
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approximated  b\  such  a  REFD  as  closely  as  we  wish  in  the  space  of  square  inte- 
grable  functions. 

(8)  Exponential  Fourier  Densities  and  Optimal  Estimation  for  Axial  Processes 

In  this  paper  we  consider  the  problem  of  estimating  axes  in  three-dimensional 

space.  An  axis  or  axial  vector  is  distinguished  from  a  polar  vector  in  that 

the  former  is  invariant  under  inversion.  Such  axes  occur  in  many  diverse  areas 

including  the  following:  geophysical  fluid  dynamics  to  estimate  the  vorticity 

of  a  flow,  paleomagnet ism  to  estimate  a  magnetic  field,  crystallography  to 

estimate  the  optic  axis  of  a  crystal,  geology  to  estimate  the  direction  of  a 

normal  to  the  axis  of  a  fold  in  a  layer  of  rock,  and  quantum  mechanics  to 

estimate  the  axis  of  rotation  of  a  rigid  body  rotation. 

Using  densities  of  the  form  exp  f  where  f  is  a  linear  combination  of 

axially  symmetric  spherical  harmonics,  estimation  problems  which  arise  by 

examining  various  possible  ways  of  obtaining  a  displacement  of  an  axis  will  be 

solved  in  this  paper.  Although  the  state  space  under  consideration  is  home- 

2  2 

omorphic  to  a  hemisphere  of  S  ,  the  results  for  estimation  on  S  cannot  be  applied 

for  several  important  reasons:  the  displacements  defined  in  that  paper  may 

result  in  a  given  point  being  displaced  to  a  non-ant ipodal  point  in  the  opposite 

2 

hemisphere,  the  densities  on  S  were  not,  in  general,  axially  symmetric,  and 

2 

the  error  criterion  used  for  S  is  undesirable  since  it  would  result  in  a 

rejection  of  the  antipode  of  the  optimal  estimate. 

Using  the  various  displacements  and  conditional  densities  obtained  in 

this  paper,  detection  for  axial  processes  would  be  described  bv  procedures 

2 

similar  to  those  used  for  S  and  S0(3). 

(9)  Estimation  and  Detection  on  Lie  Croups  -- 

There  are  five  main  sections  in  this  survey  chapter  to  be  included  in  the 
forthcoming  book.  Nonlinear  Filtering  and  Estimation  Theory--A  Status  Review, 
edited  by  E.  B.  Stear.  They  are  sunmarized  as  follows: 

(pa).  PROBABILITY  ON  THE  CIRCLE. 

There  are  many  fundamental  differences  between  the  estimation  and  detection 
problems  on  the  Euclidean  spaces  and  those  on  the  Lie  groups.  In  order  for  some 
readers  to  appreciate  them,  this  section  will  be  addressed  to  some  probabilistic 
elements  on  the  circle.  The  probability  distribution  function  and  the 


characteristic  function  on  the  circle  will  first  be  briefly  introduced. 

One  of  the  main  concerns  in  this  chapter  is  to  study  how  one  use6  the 

knowledge  of  the  probability  distribution  of  a  random  variable  taking  values 

on  a  Lie  group  to  determine  an  estimate  of  the  random  variable  that  minimizes 

a  certain  error  criterion.  The  conventional  least  squares  technique  cannot  be 

used  here.  Let  us  take  the  circle  as  an  example.  The  square  error  of  the  angles 

2 

O'  and  359°  is  (359~)c,  whereas  by  geometrical  intuition  they  are  only  lp  apart. 

In  the  sequel  we  will  look  into  this  issue  on  the  circle  in  detail. 

The  importance  of  the  normal  probability  densities  cannot  be  over¬ 
emphasized  for  estimation  and  detection  on  Euclidian  spaces.  Unfortunately, 
there  does  not  exist  an  analogous  density  on  the  circle  that  possesses  all 
the  nice  properties  of  the  normal  density.  In  fact,  the  nice  properties  of 
the  normal  density  are  almost  equally  divided  between  two  contenders  for 
normalcy,  the  folded  normal  density  and  the  circular  normal  density.  It  turns 
out  that  while  the  folded  normal  density  is  natural  to  use  for  continuous-time 
estimation,  the  circular  normal  density  is  more  suitable  for  discrete-time 
estimation.  They  will  both  be  discussed  and  compared  in  this  section. 

(Ob).  DISCRETE-TIME  ESTIMATION  ON  THE  CIRCLE. 

Estimation  for  discrete-time  systems  on  the  circle  was  studied  before, 
using  both  folded  normal  densities  and  Fourier  series  representations  of 
probability  densities.  The  optimal  estimation  equations  obtained  therein  are 
infinite-dimensional  and  cumbersome.  Although  some  numerical  simulation  has 
been  done  on  the  suboptimal  equations  obtained  from  truncating  the  higher 
order  terms,  it  is  not  clear  whether  these  equations  have  satisfactory  performance 
in  general. 

As  a  matter  of  fact,  the  "dimension"  of  the  optimal  estimation  equations 
derived  from  using  the  folded  normal  densities  increases  very  rapidly  in  time. 

When  the  Fourier  series  are  used  to  represent  probability  densities,  the 
application  of  Bayes'  rule,  which  involves  the  multiplication  of  two  a 
priori  densities,  has  the  effect  of  spreading  the  dominant  Fourier  coefficients 
into  the  higher  order  terms.  Obviously,  this  dilemma  becomes  compounded  in 
multistage  estimation  problem  when  a  sequence  of  multiplications  of  Fourier 
series  takes  place. 
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In  this  section,  we  will  present  an  alternative  approach.  The  approach 
is  based  on  a  new  class  of  probability  density  functions  which  have  the  fora 


n 

exp  E  (a,  cos  kx  +  b,  sin  kx) . 
k=0  k  k 

Such  a  density  will  be  called  an  exponential  density  of  order  n,  to  be  denoted 
by  EFD(n) . 


(9c).  CONTINUOUS -TIME  ESTIMATION  ON  THE  CIRCLE. 

A  signal  process  and  an  observation  process,  taking  values  on  s\  will  be 
formulated  in  terms  of  bilinear  Ito  matrix  differential  equations.  The  condi¬ 
tional  probability  distribution  of  the  signal,  given  observations  over  a  certain 
period  of  time,  will  be  evaluated.  Recursive  computational  schemes  for  optimal 
estimation  (filtering,  smoothing,  and  prediction),  with  respect  to  the  error 
criteria  defined  in  Subsection  II. 2,  will  be  derived.  In  fact  it  will  be  shown 
that  optimal  estimates  on  S^  can  be  obtained  recursively  by  the  use  of  an 
ordinary  vector  space  estimator  together  with  a  nonlinear  preprocessor  and  a 
nonlinear  postprocessor.  Multichannel  estimation  on  abelian  Lie  groups  will 
be  examined.  Examples  illustrating  the  optimal  estimation  procedure  are  given 
at  the  end  of  this  section. 

(9 d).  DISCRETE-TIME  ESTIMATION  ON  COMPACT  LIE  GROUPS. 

The  results  of  (2b)  can  be  easily  generalized  to  the  problems  on  compact 
Lie  groups  by  introducing  a  similar  exponential  Fourier  density  (EFD)  on  the 
group.  This  density  is  obtained  by  using  a  sequence  of  irreducible  unitary 
representations  which  form  a  complete  orthogonal  system  on  the  compact  group. 

It  can  be  shown  that  a  continuous  density  function  on  the  group  can  be 
approximated  by  such  an  EFD  as  closely  as  we  wish  in  the  space  of  square 
integrable  functions. 

As  in  the  circle  case,  the  class  of  ERD's  of  a  certain  finite  order  on 
the  compact  Lie  group  is  closed  under  the  operation  of  taking  conditional 
distributions  as  a  consequence  of  the  group  structure  of  the  group.  It  will 
become  clear  in  the  sequel  that  it  is  exactly  this  closure  property  of  the  EFD's 
that  yields  simple  estimation  schemes  which  update  the  sequential  conditional 
densities  by  recursively  revising  a  finite  and  fixed  number  of  parameters. 

In  order  to  illustrate  how  the  conditional  density  can  be  used  to 
calculate  the  optimal  estimate  on  the  group,  a  rigid  body  attitude  estimation 
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problem  in  solved  an  an  example.  The  error  criterion,  the  optimal  cellmate, 
ami  t  he  estimation  error  with  respect  to  the  criterion  will  he  derived  fot  a 
given  probability  dial rlhnl ton. 

(0e>.  OKTKirriON  FOK  lYWlNdOUS- T1NK  SYSTKMS  ON  UK  GROUPS. 

The  Idea  ot  "rolling  without  slipping"  introduced  In  Section  IV  will  now 
be  generalities  and  used  to  formulate  an  observation  process  on  an  arbitrary 
matrix  Me  group.  briefly  speaking,  we  will  inject  the  differentials  of  an 
observation  process  described  bv  a  vector  lto  differential  equation  Into  a 
hie  group  via  the  exponential  map  and  then  piece  them  together.  The  insulting 
product  integral  describes  our  observation  process  on  the  hie  group,  t he 
injected  vectot  observation  process  being  called  its  skew  form. 

The  observation  process  thus  constructed  on  a  l.le  group  will  be  seen  to 
satisfy  a  bilinear  matrix  stochastic  differential  equation,  when  its  skew 
form  is  linear.  The  observational  noise  can  be  viewed  as  entering  multiplica 
l lvelv . 

Given  an  arbitrary  bilinear  matrix  observation  process,  we  will  show  that 
the  corresponding  skew  observation  process  can  be  obtained  by  "reversing"  the 
above  injecting  procedure.  Furthermore,  these  two  procedures  will  be  seen  to 
induce  two  "almost  sure"  bijective  mappings  between  a  vector-valued  and  a  matrix¬ 
valued  function  spaces,  one  being  the  Inverse  of  the  other. 

It  Is  well  known  that  the  study  of  a  hie  group  mav  be  great Iv  simplified 
by  considering  the  tangent  space  fthe  hie  algebra!  of  the  hie  group  at  Its 
identity.  In  fact,  the  local  study  of  a  hie  group  is  entirely  equivalent  to 
the  study  of  the  finite  dimensional  liueat  algebraic  structures  ot  the  hie 
algebra.  In  this  paper,  the  above  bi ject ive  mappings  facilitate  similar 
simplification.  It  enables  us  to  evaluate  the  likelihood  latio  in  a  tinite 
dimensional  linear  space--the  hie  algebra! 

In  view  of  the  above  construction,  the  null  and  the  alternative  hypotheses 
that  the  signal  Is  respectively  absent  and  present  in  the  observation  on  a 
hie  group  can  be  written  as  a  pair  of  bilinear  matrix  stochastic  differential 
equations.  Using  the  bijectlve  mappings,  we  mav  transform  these  hypotheses  on 
a  hie  group  into  those  on  the  corresponding  hie  algebra.  There  the  likelihood 
ratio  can  be  expressed  by  the  well-known  Dun can's  formula.  Thus  the  evaluat tv'n 
of  the  likelihood  ratio  on  a  hie  group  also  hinges  on  the  least  -  squat es 
est imat ion. 


When  the  signal  is  n  linear  diffusion  process,  the  idea  of  using  (lie 
bijective  mappings  to  work  in  the  Lie  algebra  also  leads  to  a  finite  dimensional 
filtering  equation  for  evaluating  the  least -squares  estimate.  This  equation 
1 8  indeed  an  immediate  extension  of  the  Kalman-Bucy  filter  to  the  case  with 
observation  on  hie  groups. 

(10)  Estimation  Problems  witli  Lie  Group  Structure  -- 

The  exponential  Fourier  densities  were  used  to  study  estimation  on  the 
unit  circle,  the  unit  sphere,  the  three-dimensional  rotation  group,  and  the 
projective  two-space  in  a  sequence  of  recent  papers.  Many  finite-dimensional 
optimal  estimation  schemes  were  obtained  mainly  due  to  the  closure  property  of 
the  exponential  Fourier  densities  of  any  given  finite  order  under  the  operation 
of  taking  conditional  distributions.  Another  reason  for  using  exponential 
Fourier  densities  is  that  any  continuous  or  bounded-variation  probability  density 
on  the  aforementioned  spaces  can  be  very  closely  approximated  by  such  a  density. 

It  is  the  purpose  of  this  paper  to  generalise  the  previous  results  to  an 
arbitrary  compact  Lie  group  and  thereby  to  illustrate  that  it  is  the  structure 
of  a  compact  Lie  group  that  accounts  for  the  usefulness  of  the  exponential 
Fourier  densities. 

As  it  is  expected  that  most  readers  of  this  paper  will  be  engineers,  some 
definitions  and  preliminary  results  will  be  briefly  summarized  to 
facilitate  our  presentation  in  the  paper. 

(11)  Optimal  Filters  for  Nilpotent  Associate  Algebraic  Bilinear  Systems 

In  this  paper  least  squares  filtering  problem  is  considered  wherein  the 
signal  process  of  interest  is  generated  by  a  bilinear  dynamical  system  driven 
by  a  Gauss -Markov  process  while  the  observation  process  is  generated  by  adding 
white,  gaussian  noise  to  the  above  Gauss-Markov  process. 

Explicit  solution  to  the  above  nonlinear  estimation  problem  is  obtained 
when  the  matrix  algebra  associated  with  this  bilinear  equation  is  nilpotent; 
i.e.,  the  produce  of  more  than  a  certain  fixed  finite  number  of  matrices  in 
this  algebra  vanishes.  It  is  shown  that  the  resulting  filter  consists  of  a 
Kalman-Bucv  filter  followed  by  a  bilinear  (time-varying)  system  which  also 
possesses  the  nilpotency  property. 

Due  to  the  above  features  the  filter  is  seen  to  be  quite  suitable  for 
practical  realization.  On  the  other  hand  the  dimensionality  of  the  filter  is 
quite  high.  By  means  of  an  illustrative  example  it  is  shown  how  the  dimen- 
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sionality  may  be  reduced  by  eliminating  certain  Inherent  redundancies. 

The  motivation  for  this  paper  cotnes  from  the  belief  that  the  class  of  bili¬ 
near  signal  processes  considered  here  can  approximate  a  much  wider  class  of 
general  nonlinear  signal  processes. 

(12)  Optimal  Filters  for  Bilinear  Systems  with  Nilpotent  Lie  Algebras 

This  paper  discusses  a  least  squares  filtering  problem  in  which  the 
signal  process  to  be  estimated  is  generated  by  a  bilinear  dynamical  system. 

The  problem  is  illustrated  schematically  in  figure  1  below. 


signal 

x(t) 


Best  filtered 
Estimate 
of  Signal 

x(t|t). 


figure  1 

A  Block  Schematic  of  the  Nonlinear  Filtering  Problem 


An  exact  constructive  solution  to  the  above  problem  is  provided  under 

the  assumption  that  the  Lie  algebra  associated  with  the  bilinear  system  is 
nilpotent .  The  resulting  filter  turns  out  to  be  of  the  form  shown  in  figure  2. 


observations 
*(t) 


X  IIIIV  v  U  U  J.VIIO 

dv(t)£  dz(t)  -  £(tjt)dt 


figure  2 

Optimal  Filter  Structure 
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Moreover  the  bilinear  system  in  the  filter  structure  also  possesses  the 
nilpotency  property. 

The  dimensionality  of  the  above  filter  is  formidable  and  some  ways  of 
reducing  it  are  demonstrated  via  an  illustrative  example.  Finally  three 
important  special  cases  are  studied  in  which  the  bilinear  system  in  figure 
2  can  be  replaced  by  a  simple,  memory less  nonlinearity.  Possible  applica¬ 
tion  of  this  work  include  estimation  of  rotational  motion  of  rigid  bodies 
with  one  degree  of  freedom. 

(13)  Optimal  Estimation  for  the  Satellite  Attitude  Using  Star  Tracker 
Measurements 

This  paper  is  mainly  concerned  with  estimating  the  satellite  attitude 
given  the  gyro  readings  and  the  star  tracker  measurements  of  a  commonly  used 
satellite  attitude  measuring  unit  (SAMU).  The  SAMU  is  used  in  such  satellites 
as  the  high  energy  astronomy  observatory  (HEAO)  and  the  precision  pointing  control 
system  (PPCS).  It  is  composed  of  3  to  6  rate  gyros  and  2  star  trackers.  The 
satellite  attitude  is  propagated  over  a  certain  number  of  small  time  intervals 
by  integrating  the  satellite  angular  rates  determined  from  the  gyro  reading. 

Gyro  drift  rates,  misalignments,  and  lack  of  a  precise  initial  attitude 
reference  then  make  it  necessary  to  employ  two  gimbaled  star  trackers  to 
provide  a  bench  mark  to  the  further  propagation  of  the  satellite  attitude.  A 
star  tracker  utilizes  an  image  dissector  tube  to  locate  the  position  of  a 
star  on  its  photosensitive  surface.  Due  to  the  non-stationary  nonlinear 
characteristics  of  the  image  dissector  deflection  coils  and  the  white  noise 
from  the  processing  electronics,  it  is  at  this  stage  that  estimation  is 
required. 

A  new  representation  of  a  probability  density  of  a  three  dimensional 
rotation  called  the  exponential  Fourier  density  (EFD),  was  recently  introduced 
which  has  the  desirable  closure  property  under  the  operation  of  taking 
conditional  distributions.  Using  the  EFD's,  an  approach  was  suggested  to 
derive  recursive  formulas  for  updating  the  conditional  densities  of  a 
rotational  process  given  a  nonlinear  observation  in  additive  white  noise. 

In  this  paper,  this  approach  is  carried  out  for  the  aforementioned 
satellite  attitude  estimation  problem.  The  recursive  formulas  for  updating 
the  conditional  densities  of  the  satellite  attitude  are  derived  for  arbitrary 
star  tracker  equations.  These  general  formulas  are  included  here  to  accom- 


modate  possible  future  consideration  of  the  distortion  characteristics  of  the 
image  dissector  deflection  coils  and  possible  future  change  in  the  star  tracker 
configuration.  These  general  formulas  also  provide  a  basis  on  which  special 
cases  can  be  easily  analyzed.  However,  they  involve  a  large  amount  of 
computation.  Their  feasibility  for  on-board  implementation  is  highly 
questionable . 

In  a  conversation  with  E.  J.  Lefferts  of  the  GSFC/NASA,  it  was  observed 
by  him  that  by  choosing  appropriately  the  mathematical  description  of  the  star 
tracker  configuration,  the  star  tracker  measurement  can  be  expressed  in 
closed  form  as  a  linear  combination  of  the  rotational  harmonic  functions  of 
order  one.  This  observation  substantially  simplifies  the  optimal  estimation 
scheme  and  greatly  reduces  the  amount  of  computation  required  in  both  design¬ 
ing  and  utilizing  the  scheme.  A  detailed  derivation  of  the  associated  equations 
is  included  in  the  paper. 

A  close  look  at  the  mathematical  models  for  the  star  tracker  measurement 
revealed  that  the  measurement  models  are  not  observable.  The  nonobservability 
causes  a  pseudo-image  of  each  observed  star.  As  the  extended  K-B  filtering  is 
merely  a  local  processing,  it  does  not  pick  up  the  pseudo-image  and  is  therefore 
immune  from  its  effect.  In  contrast,  the  optimal  scheme  does  not  have  any 
"blind  spots"  and  thus  assigns  an  equal  probability  to  the  double  images  of 
each  of  the  observed  stars.  An  example  illustrating  such  nonobservability  is 
given. 

Fortunately  enough,  this  difficulty  resulting  from  the  nonobservability 
can  be  remedied  by  introducing  a  "pseudo-measurement"  of  the  second  apparent 
star  direction  cosine  ^(k)  with  respect  to  the  tracker  base  reference  axes. 

We  note  that  U2(k)  is  the  component  of  the  direction  vector  u(k)  that  is 
perpendicular  to  the  tracker  f ie ld-of-view  and  hence  cannot  be  measured 
directly  by  the  tracker.  However,  from  using  the  satellite  attitude  estimate 
s(t)  at  the  previous  step  t=k>*l,  Ujfk)  can  be  predicted,  which  is  to  be  used 
as  a  "measurement"  of  the  real  ^(k).  Facilitated  with  such  pseudo¬ 
measurement,  the  pseudo-image  of  the  observed  star  can  be  eliminated.  For 
want  of  a  mathematically  rigorous  proof,  only  a  heuristic  explanation  for 
this  pseudo-measurement  approach,  which  is  believed  to  be  new,  is  given  in 
the  paper. 

The  computation  required  to  produce  the  optimal  estimate.  Involves 
integrating  the  conditional  covariance  matrix  of  the  attitude  quaternion, 
which  is  very  CPU-time-consuming.  Encouraging  is  the  fast  and  high  con- 
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central  ion  of  the  conditional  probability  density  at  the  satellite  attitude 
under  estimation.  This  phenomenon  is  dictated  by  the  theory  and  suggests  two 
possible  ways  to  get  around  the  difficulty  of  integration.  One  way  is  to 
localize  the  integration.  Another  way  is  to  use  the  maximum  likelihood 
estimate  instead  of  the  optimal  estimate.  Roth  methods  were  researched  and 
implemented  on  the  computer.  The  simulation  results  Indicate  that  there  is 
virtually  no  difference  between  the  estimates  obtained  in  these  two  ways 
(at  least  for  the  models  used  in  this  paper). 

The  maximum  likelihood  estimator  avoids  not  only  integration  altogether 
but  also  the  task  of  computing  the  maximum  eigenvalue  and  its  eigenvector. 

All  it  needs  is  the  updated  Fourier  coefficients  of  the  conditional  density, 
which  are  obtained  through  simple  algebraic  formulas.  Therefore,  the  maximum 
likelihood  estimator  is  used  in  comparison  with  the  extended  K-B  filter,  a 
standard  method  for  spacecraft  attitvide  estimation. 

A  comparison  between  the  K-B  filtering  and  our  maximum  likelihood 
estimator  was  conducted  by  E.  J.  Lefferts  of  the  GSFC/NASA.  A.  N.  Mansfield 
of  the  CSTA  generated  a  sequence  of  33  star  tracker  observations.  The  average 
body  rates  were  provided  every  one-third  of  a  second,  and  the  tracker  ob¬ 
servation  was  taken  every  two  minutes.  The  standard  deviation  of  the  tracker 
measurement  noises  is  20  arcseconds.  For  such  a  low  noise  level,  it  is 
known  that  linearization  is  a  very  good  approximation.  Therefore,  it  is  not 
surprising  that  the  maximum  likelihood  estimator  is  not  much  better  than  the 
K-B  filter. 

However,  the  comparison  results  indicate  that  the  maximum  likelihood 
estimator  is  almost  always  better  and  converges  faster  than  the  K-B  filter. 

It  is  also  noted  that:  (1)  The  simulated  measurement  data  are  in  strict 
accord  with  the  system  model,  assuming  all  the  true  values  of  the  model 
parameters  and  the  noise  statistics  are  given.  (2)  There  is  no  random  driving 
term  in  the  state  dynamics  model  for  the  spacecraft  attitude.  Under  these 
two  conditions  in  addition  to  the  low  measurement  noise  level  mentioned  above, 
even  simple-minded  estimators  can  be  expected  to  perform  near  optimal.  But 
these  conditions  are  usually  far  from  being  met  in  reality. 

The  simulated  examples  in  Section  IX  were  chosen  to  test  the  robustness 
of  our  new  schemes  and  represent  tougher  working  conditions  than  the  real  ones. 
The  simulation  results  to  be  depicted  in  graphs  indicate  that  both  the  local 
integration  estimator  and  the  maximum  likelihood  estimator  track  the  signal 
nicely. 


The  robustness  of  an  estimator  toward  uncertainty  of  system  parameters 
and  the  random  driving  term  in  the  state  dynamics  is  perhaps  the  most  important 
consideration  in  real  world  application.  While  there  is  every  reason  to 
believe  that  the  maximum  likelihood  estimator  is  superior  in  this  regard, 
the  issue  remains  to  be  resolved  in  the  future. 


(14)  Convolution  of  Exponential  Fourier  Densities  and  Filtering  on  the  Circle 

Whi le  the  EFD(n)'s  are  closed  under  conditioning,  they  are  not  closed  under 
convolution.  This  deficiency  has  prevented  us  from  including  random  driving 
terms  in  the  signal  processes  in  our  results  using  EFD(n)'s.  In  this  paper, 
we  will  present  a  striking  property  of  the  EFD(n)'s;  namely,  they  are  almost 
closed  under  convolution. 

The  maximum  informational  distance,  in  the  sense  of  Kullback,  between  the 
convolution  of  two  EFD(l)'s  and  its  best  fit  by  an  EFD(l)  is  numerically  calcu¬ 
lated  and  is  0.00539412984011850.  Such  a  small  number  indicates  that  the 
convolution  of  any  two  EFD(l)'s  is  virtually  an  EFD(l).  Hence  it  is  not  sur¬ 
prising  that  replacing  the  convolution  of  two  a  priori  EFD(l)'s  with  its  best 
fit  EFD(l)  yields  a  near  optimal  estimate,  which  is  almost  indistinguishable 
from  the  optimal  one.  All  these  numerical  results  are  reported  in  the  paper. 

The  case  of  EFD(2)’s  will  also  be  thoroughly  studied  in  the  paper.  More 
details  will  be  submitted  to  AFOSR  as  soon  as  available. 
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